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receptors for diverse extracellular signals to different effectors. Ras directly activates the 
downstream kinase Raf which in turn induces the mitogen activated protein kinase (MAPK) 
cascade. 

5 The Ras is an example of a guanine nucleotide exchange factor (GEF). A mutation in a GEF 
such as Ras has been implicated in development of a range of cancers and tumours. There 
is a need, therefore, to identify new GEFs and to develop therapeutic and diagnostic protocols 
based on modulating function of the GEF singalling pathways. 

10 Accordingly, one aspect of the present invention contemplates an isolated nucleic acid 
molecule comprising a sequence of nucleotides encoding or complementary to a sequence 
encoding an amino acid sequence having homology to a guanine nucleotide exchange factor 
(GEF) or a derivative of said gene regulator. 

15 More particularly, the present invention is directed to an isolated nucleic acid molecule 
comprising a sequence of nucleotides or a complementary form thereof selected from: 

(i) a nucleotide sequence set forth in SEQ ID NO: 1; 

(ii) a nucleotide sequence encoding an amino acid sequence set forth in SEQ ID NO: 2; 
20 (iii) a nucleotide sequence having at least about 40% similarity to the nucleotide 

sequence of (i) or (ii); and 
(iv) a nucleotide sequence capable of hybridizing under low stringency conditions to 

the nucleotide sequence set forth in (i), (ii) or (iii). 

25 Preferably, the percentage similarity is at least about 50%. More preferably, the percentage 
similarity is at least about 60%. 



Reference herein to a low stringency at 42 °C includes and encompasses from at least about 1% 
v/v to at least about 15% v/v formamide and from at least about 1M to at least about 2M salt for 
30 hybridisation, and at least about 1M to at least about 2M salt for washing conditions. Alternative 
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stringency conditions may be applied where necessary, such as medium stringency, which 
includes and encompasses from at least about 16% v/v to at least about 30% v/v formamide and 
from at least about 0.5M to at least about 0.9M salt for hybridisation, and at least about 0.5M 
to at least about 0.9M salt for washing conditions, or high stringency, which includes and 
5 encompasses from at least about 3 1% v/v to at least about 50% v/v formamide and from at least 
about 0.01M to at least about 0. 15M salt for hybridisation, and at least about 0.01M to at least 
about 0.1 5M salt for washing conditions. 

The term "similarity" as used herein includes exact identity between compared sequences at the 
10 nucleotide or amino acid level. Where there is non-identity at the nucleotide level, "similarity" 
includes differences between sequences which result in different amino acids that are nevertheless 
related to each other at the structural, functional, biochemical and/or conformational levels. 
Where there is non-identity at the amino acid level, "similarity" includes amino acids that are 
nevertheless related to each other at the structural, functional, biochemical and/or conformational 
15 levels. 

The present invention extends to nucleic acid molecules with percentage similarities of 
approximately 65%, 70%, 75%, 80%, 85%, 90% or 95% or above or a percentage in between. 

20 The nucleic acid molecule of the present invention is hereinafter referred to as constituting the 
"mcgT % gene. The protein encoded by mcg7\s referred to herein as "MCG7" and is involved 
in signal transduction. 

The present invention extends to the naturally occurring genomic mcg7 nucleotide sequence 
25 or corresponding cDNA sequence or to derivatives thereof. Derivatives contemplated in the 
present invention include fragments, parts, portions, mutants, homologues and analogues of 
MCG7 or the corresponding genetic sequence. Derivatives also include single or multiple 
amino acid substitutions, deletions and/or additions to MCG7 or single or multiple nucleotide 
substitutions, deletions and/or additions to meg 7. Derivatives also includes modifications to 
30 nucleotide bases or amino acid residues to, for example, alter glycosylation sites or amino 
acid side chains. "Additions" to the amino acid or nucleotide sequences include fusions with 
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other peptides, polypeptides or proteins or fusions to nucleotide sequences. Reference herein 
to "MCG7" or "mcg7" includes references to all derivatives thereof including functional 
derivatives and immunologically interactive derivatives of MCG7. 

5 The mcg7 of the present invention is particularly exemplified herein from humans and in 
particular from human chromosome llql3. 

The present invention also extends, however, to a range of homologues from, for example, 
primates, livestock animals (eg. sheep, cows, horses, donkeys, pigs), companion animals (eg. 
10 dogs, cats) laboratory test animals (eg. rabbits, mice, rats, guinea pigs), birds (eg. chickens, 
ducks, geese, parrot), insects, nematodes, eukaryotic microorganisms and captive wild 
animals (eg. deer, foxes, kangaroos). Reference herein to mcg7 or MCG7 includes reference 
to these molecules of human origin as well as novel forms of non-human origin. 

15 The nucleic acid molecules of the present invention may be DNA or RNA, When the nucleic 
acid molecule is in DNA form, it may be genomic DNA or cDNA. RNA forms of the nucleic 
acid molecules of the present invention are generally mRNA. 

Although the nucleic acid molecules of the present invention are generally in isolated form, they 
20 may be integrated into or ligated to or otherwise fused or associated with other genetic 
molecules such as vector molecules and in particular expression vector molecules. Vectors and 
expression vectors are generally capable of replication and, if applicable, expression in one or 
both of a prokaryotic cell or a eukaryotic cell. Preferably, prokaryotic cells include E. coli, 
Bacillus sp and Pseudomonas sp. Preferred eukaryotic cells include yeast, fungal, mammalian 
25 and insect cells. 

Accordingly, another aspect of the present invention contemplates a genetic construct comprising 
a vector portion and an animal, more particularly a mammalian and even more particularly a 
human mcg7 gene portion, which mcg7 gene portion is capable of encoding an mcg7 polypeptide 
30 or a functional or immunologically interactive derivative thereof. 
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Preferably, the mcg7 gene portion of the genetic construct is operably linked to a promoter on 
the vector such that said promoter is capable of directing expression of said mcg7 gene portion 
in an appropriate cell. 

5 In addition, the mcg7 gene portion of the genetic construct may comprise all or part of the gene 
fused to another genetic sequence such as a nucleotide sequence encoding glutathione- S- 
transferase or part thereof 

The present invention extends to such genetic constructs and to prokaryotic or eukaryotic cells 
10 comprising same. 

It is proposed in accordance with the present invention that MCG7 is a GEF involved in 
signal transduction. Mutations in mcg7 or MCG7 may result in defective control of cell 
proliferation leading to the development of or a propensity to develop various types of cancer. 

15 

A deletion or aberration in the mcg7 gene may also be important in the detection of cancer 
or a propensity to develop cancer. An aberration may be a homozygous mutation or a 
heterozygous mutation. The detection may occur at the foetal or post-natal level. Detection 
may also be at the germline or somatic cell level. Furthermore, a risk of developing cancer 
20 may be determined by assaying for aberrations in the parents of a subject under investigation. 

According to this aspect of the present invention, there is contemplated a method of detecting 
a condition caused or facilitated by an aberration in mcg7, said method comprising 
determining the presence of a single or multiple nucleotide substitution, deletion and/or 
25 addition or other aberration to one or both alleles of said mcgl wherein the presence of such 
a nucleotide substitution, deletion and/or addition or other aberration may be indicative of 
said condition or a propensity to develop said condition. 

The nucleotide substitutions, additions or deletions may be detected by any convenient means 
30 including nucleotide sequencing, restriction fragment length polymorphism (RFLP), 
polymerase chain reaction (PCR), oligonucleotide hybridization and single stranded 
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conformation polymorphism analysis (SSCP) amongst many others. An aberration includes 
modification to existing nucleotides such as to modify glycosylation signals amongst other 
effects. 

5 In an alternative method, aberrations in the mcg7 gene are detected by screening for mutations 
in MCG7. 

A mutation in MCG7 may be a single or multiple amino acid substitution, addition and/or 
deletion. The mutation in mcg7 may also result in either no translation product being 
10 produced or a product in truncated form. A mutation may also.be an altered glycosylation 
pattern or the introduction of side chain modifications to amino acid residues. 

According to this aspect of the present invention, there is provided a method of detecting a 
condition caused or facilitated by an aberration in mcg7, said method comprising screening 
15 for a single or multiple amino acid substitution, deletion and/or addition to MCG7 wherein 
the presence of such a mutation is indicative of or a propensity to develop said condition. 

A particularly convenient means of detecting a mutation in MCG7 is by use of antibodies. 

20 Accordingly another aspect of the present invention is directed to antibodies to MCG7 and its 
derivatives. Such antibodies may be monoclonal or polyclonal and may be selected from 
naturally occurring antibodies to MCG7 or may be specifically raised to MCG7 or derivatives 
thereof In the case of the latter, MCG7 or its derivatives may first need to be associated with 
a carrier molecule. The antibodies to MCG7 of the present invention are particularly useful as 

25 diagnostic agents. 

For example, antibodies to MCG7 and its derivatives can be used to screen for wild-type MCG7 
or for mutated MCG7 molecules. The latter may occur, for example, during or prior to certain 
cancer development. A differential binding assay is also particularly useful. Techniques for such 
30 assays are well known in the art and include, for example, sandwich assays and ELISA. 
Knowledge of normal MCG7 levels or the presence of wild-type MCG7 may be important for 
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diagnosis of certain cancers or a predisposition for development of cancers or for monitoring 
certain therapeutic protocols. 

As stated above antibodies to MCG7 of the present invention may be monoclonal or polyclonal 
5 or may be fragments of antibodies such as Fab fragments. Furthermore, the present invention 
extends to recombinant and synthetic antibodies and to antibody hybrids. A "synthetic antibody" 
is considered herein to include fragments and hybrids of antibodies. 

For example, specific antibodies can be used to screen for wild-type MCG7 molecule or specific 
10 mutant molecules such as molecules having a certain deletion. This would be important, for 
example, as a means for screening for levels of MCG7 in a cell extract or other biological fluid 
or purifying MCG7 made by recombinant means from culture supernatant fluid or purified from 
a cell extract. Techniques for the assays contemplated herein are known in the art and include, 
for example, sandwich assays and ELISA. 

15 

It is within the scope of this invention to include any second antibodies (monoclonal, polyclonal 
or fragments of antibodies or synthetic antibodies) directed to the first mentioned antibodies 
discussed above. Both the first and second antibodies may be used in detection assays or a first 
antibody may be used with a commercially available anti-immunoglobulin antibody. An antibody 
20 as contemplated herein includes any antibody specific to any region of wild-type MCG7 or to a 
specific mutant phenotype or to a deleted or otherwise altered region. 

Both polyclonal and monoclonal antibodies are obtainable by immunization of a suitable animal 
or bird with MCG7 or its derivatives and either type is utilizable for immunoassays. The 

25 methods of obtaining both types of sera are well known in the art. Polyclonal sera are less 
preferred but are relatively easily prepared by injection of a suitable laboratory animal or bird 
with an effective amount of MCG7 or antigenic parts thereof or derivatives thereof, collecting 
serum from the animal or bird, and isolating specific sera by any of the known immunoadsorbent 
techniques. Although antibodies produced by this method are utilizable in virtually any type of 

30 immunoassay, they are generally less favoured because of the potential heterogeneity of the 
product. 
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The use of monoclonal antibodies in an immunoassay is particularly preferred because of the 
ability to produce them in large quantities and the homogeneity of the product. The preparation 
of hybridoma cell lines for monoclonal antibody production derived by fusing an immortal cell 
line and lymphocytes sensitized against the immunogenic preparation can be done by techniques 
5 which are well known to those who are skilled in the art. 

Another aspect of the present invention contemplates a method for detecting MCG7 or a 
derivative thereof in a biological sample said method comprising contacting said biological 
sample with an antibody specific for MCG7 or its derivatives or homologies for a time and under 
10 conditions sufficient for an antibody-MCG7 complex to form, and then detecting said complex. 

Preferably, the biological sample is a cell extract from a human or other animal or a bird. 

The presence of MCG7 may be accomplished in a number of ways such as by Western blotting 
1 5 and ELISA procedures. A wide range of immunoassay techniques are available as can be seen 
by reference to US Patent Nos. 4,016,043, 4, 424,279 and 4,018,653. These include both single- 
site and two-site or "sandwich" assays of the non-competitive types, as well as traditional 
competitive binding assays. These assays also include direct binding of a labelled antibody to a 
target. 

20 

Sandwich assays are among the most useful and commonly used assays and are favoured for use 
in the present invention. A number of variations of the sandwich assay technique exist, and all 
are intended to be encompassed by the present invention. Briefly, in a typical forward assay, an 
unlabelled antibody is immobilized on a solid substrate and the sample to be tested brought into 

25 contact with the bound molecule. After a suitable period of incubation, for a period of time 
sufficient to allow formation of an antibody-antigen complex, a second antibody specific to the 
antigen, labelled with a reporter molecule capable of producing a detectable signal is then added 
and incubated, allowing time sufficient for the formation of another complex of antibody-antigen- 
labelled antibody. Any unreacted material is washed away, and the presence of the antigen is 

30 determined by observation of a signal produced by the reporter molecule. The results may either 
be qualitative, by simple observation of the visible signal, or may be quantitated by comparing 
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with a control sample containing known amounts of hapten. Variations on the forward assay 
include a simultaneous assay, in which both sample and labelled antibody are added 
simultaneously to the bound antibody. These techniques are well known to those skilled in the 
art, including any minor variations as will be readily apparent. In accordance with the present 
5 invention the sample is one which might contain MCG7 including cell extract or, tissue biopsy. 
The sample is, therefore, generally a biological sample comprising biological fluid but also 
extends to fermentation fluid and supernatant fluid such as from a cell culture. 

In the typical forward sandwich assay, a first antibody having specificity for the MCG7 or an 
10 antigenic part thereof or a derivative thereof or antigenic parts thereof, is either covalently or 
passively bound to a solid surface. The solid surface is typically glass or a polymer, the most 
commonly used polymers being cellulose, polyacrylamide, nylon, polystyrene, polyvinyl chloride 
or polypropylene. The solid supports may be in the form of tubes, beads, discs of microplates, 
or any other surface suitable for conducting an immunoassay. The binding processes are well- 
1 5 known in the art and generally consist of cross-linking covalently binding or physically adsorbing, 
the polymer-antibody complex is washed in preparation for the test sample. An aliquot of the 
sample to be tested is then added to the solid phase complex and incubated for a period of time 
sufficient (e.g. 2-40 minutes) and under suitable conditions (e.g. 25 °C) to allow binding of any 
subunit present in the antibody. Following the incubation period, the antibody subunit solid 
20 phase is washed and dried and incubated with a second antibody specific for a portion of the 
hapten. The second antibody is linked to a reporter molecule which is used to indicate the 
binding of the second antibody to the hapten. 

An alternative method involves immobilizing the target molecules in the biological sample and 
25 then exposing the immobilized target to specific antibody which may or may not be labelled with 
a reporter molecule. Depending on the amount of target and the strength of the reporter 
molecule signal, a bound target may be detectable by direct labelling with the antibody. 
Alternatively, a second labelled antibody, specific to the first antibody is exposed to the target- 
first antibody complex to form a target-first antibody-second antibody tertiary complex. The 
30 complex is detected by the signal emitted by the reporter molecule. 
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By "reporter molecule" as used in the present specification, is meant a molecule which, by its 
chemical nature, provides an analytically identifiable signal which allows the detection of antigen- 
bound antibody. Detection may be either qualitative or quantitative. The most commonly used 
reporter molecules in this type of assay are either enzymes, fluorophores or radionuclide 
5 containing molecules (i.e. radioisotopes) and chemiluminescent molecules. 

In the case of an enzyme immunoassay, an enzyme is conjugated to the second antibody, 
generally by means of glutaraldehyde or periodate. As will be readily recognized, however, a 
wide variety of different conjugation techniques exist, which are readily available to the skilled 
artisan. Commonly used enzymes include horseradish peroxidase, glucose oxidase, beta- 

10 galactosidase and alkaline phosphatase, amongst others. The substrates to be used with the 
specific enzymes are generally chosen for the production, upon hydrolysis by the corresponding 
enzyme, of a detectable colour change. Examples of suitable enzymes include alkaline 
phosphatase and peroxidase. It is also possible to employ fluorogenic substrates, which yield a 
fluorescent product rather than the chromogenic substrates noted above. In all cases, the 

1 5 enzyme-labelled antibody is added to the first antibody hapten complex, allowed to bind, and 
then the excess reagent is washed away. A solution containing the appropriate substrate is then 
added to the complex of antibody-antigen-antibody. The substrate will react with the enzyme 
linked to the second antibody, giving a qualitative visual signal, which may be further quantitated, 
usually spectrophotometrically, to give an indication of the amount of hapten which was present 

20 in the sample. "Reporter molecule" also extends to use of cell agglutination or inhibition of 
agglutination such as red blood cells on latex beads, and the like. 

Alternately, fluorescent compounds, such as fluorescein and rhodamine, may be chemically 
coupled to antibodies without altering their binding capacity. When activated by illumination 

25 with light of a particular wavelength, the fluorochrome-labelled antibody adsorbs the light 
energy, inducing a state to excitability in the molecule, followed by emission of the light at a 
characteristic colour visually detectable with a light microscope. As in the EIA, the fluorescent 
labelled antibody is allowed to bind to the first antibody-hapten complex. After washing off the 
unbound reagent, the remaining tertiary complex is then exposed to the light of the appropriate 

30 wavelength the fluorescence observed indicates the presence of the hapten of interest. 
Immunofluorescence and EIA techniques are both very well established in the art and are 
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particularly preferred for the present method. However, other reporter molecules, such as 
radioisotope, chemiluminescent or bioluminescent molecules, may also be employed. 

As stated above, the present invention extends to genetic constructs capable of encoding 
5 MCG7 or functional derivatives thereof Such genetic constructs are also contemplated to be 
useful in modulating expression of specific genes in which mcgl is involved in tissue-specific 
or temporal regulation. 

Accordingly, another aspect of the present invention is directed to a genetic construct 
10 comprising a nucleotide sequence encoding a peptide, polypeptide or protein and meg 7 or a 
functional derivative or homologue thereof capable of modulating the expression of said 
nucleotide sequence. 
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The present invention is further described with reference to the following non-limiting figures 
and Examples. 

In the Figures: 

5 

Figure 1 is a representation showing similarity of MCG7 with GEFs of organisms. 

Figure 2(a) is a representation of the nucleotide sequence and corresponding amino acid 
sequence of meg 7. An exon is shown in the nucleotide sequence in lower case (nucleotides 
10 183-298). 

Figure 2(b) is a representation of the nucleotide sequence and corresponding amino acid 
sequence of meg 7 but without the exon shown in Fig. 2(a). The cDNA molecules of Fig. 
2(a) and Fig. 2(b) differ by the inclusion and exclusion of the exon shown in Figure 2(a) in 
15 lowercase. 

Figure 3 is a representation showing a comparison between MCG7 and a homologue from 
Caenorhabditis elegans using BEST FIT algorithm. This top codon is also present in a mouse 
EST and sequence alignment between human and mouse ESTs suggests this region represents 
20 the 5' UTR. Furhtermore, protein homology with the C. elegans protein (shown below) 
suggests the underlined ATG codon to represent the true initiation codon. 

In the figure, the following sequences are colour coded: 

25 orange 1 nematode DVDEEDEVEDIEF 

orange 3 human DVDGDGHISQEEF 
nematode DHDRDGFISQEEF 



30 
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orange 



4 



human 



DQNQDGCISREEM 



nematode 



DVDMDGQISKDEL 



pink 



2 



human 



HFVHVAEKWQLQNFNTLMAWGGLSHSSISRLKETH 



5 



nematode 



KFVHVAKHLRKINNFNTLMSVVGGITHSSVARLAKTY 



yellow 5 

human HNFQESNSLRPVACRHCKALILGIYKQGLKCRACGVNCHKQCKDRLSVE 
nematode HNFHETTFLTPTTCNHCNKLLWGILRQGFKCKDCGLAVHSCCKSNAVAE 



Figure 4 is a representation of an alignment of human and murine mcgl nucleotide sequences. 

Figure 5 is a representation of further 5' nucleotide and corresponding amino acid sequence 
for mcgl , 



Figure 6 is a graphical representation of GDP release assay. □ Experiment #1 (mean of 
duplicates). 0 Experiment #2 (mean of duplicates). Exchange reaction contained 36pMols 
of GSTmcg7 (N-terminally truncated) and 1.6-12.8 pMols of recombinant N-Ras.GDP. 
Reaction time 6 mins. 
20 Estimated reaction constants: 



10 



15 



= 2.1/xM, = 37pMoI/6min/36pMol [Exptfl] 
K m = 1.5/xM, V inax = 30.3pMol/6 min/36pMol [Expt#2] 



P:\OPER\EJH\MCG7.PRV - 23/5?! 



- 14- 

EXAMPLE 1 

A human gene (designated mcgT) was identified and isolated from chromosome 1 lql3 which 
encodes a protein that bears striking homology with guanine nucleotide exchange factors 
5 (GEFs) from a wide variety of organisms (Fig. 1). 

EXAMPLE 2 

The composite mcgl cDNA sequence is at least 2.4kb in length and Figures 2(a) and 2(b) 
10 show a predicted amino acid sequence of 609 amino acids. Alternative start sites may yield 
a protein of 714 amino acids (Fig. 5). 

EXAMPLE 3 

15 A mcgl homologue from C. elegans has been identified, the product of which is highly 
conserved with that of MCG7 (Fig. 3). There are several salient features of the protein which 
have been highlighted in Fig. 3 - namely: a guanine nucleotide binding region (pink), a 
diacylglycerol binding region (yellow), and M EF-hand"-calcium binding regions (orange). 
In addition, there are several potential cAMP, protein kinase C, and casin kinase II 

20 phosphorylation sites, as well as a number of potential sites for glycosylation (not indicated). 

EXAMPLE 4 

A number of partial human and murine EST clones exist for mcgl , 

25 

EXAMPLE 5 

The best characterised GEFs are the family of ras oncoproteins, which play a pivotal role in 
signal transduction and when mutated are responsible for tumour development. A variety of 
30 therapeutic regimes for cancer treatment have been designed to specifically interfere with the 
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ras signalling pathways. There is potential, therefore that the product of mcgl could also be 
a target for such clinical strategies. 

EXAMPLE 6 

5 Initiation codons for meg 7. 

The nucleotide sequence for mcg7 cDNA was extended 5' with genomic DNA sequence from 
Genbank accession number AC000134 (positions 1-321) and analysed for additional coding 
sequence 5* to the putative initiation codon (nt 681-683). An additional in-frame ATG occurs 
at position nt 495-497 when the alternatively splice exon (position nt 504-609) is present. 
10 This closely matches the Kozak consensus. When this exon is absent, then the ATG is not 
in-frame and other possible initiation codons are absent resulting translation shown in lower 
case lettering. Further evidence that the initiation codon at position nt 681-683 is the true 
imitation site is given below in Figure 4. 

15 Alignment of human and murine mcg7 cDNA sequences is shown in Figure 4. The murin 
sequence represents a composite of 2 cDNA sequences from the expressed sequence tag 
database (accession numbers W71787 and AA237373). The putative initiation codon is at 
position nt 360-362. Both murine ESTs appear to have an upstream in-frame stop codon at 
position nt 326-328, downstream of the differentially spliced exon. Nucleotide differences 

20 between human and murine sequences are shown in lower case lettering and identical residues 
are indicated with asterisks. 

The data are shown in Figures 4 and 5 and strongly suggest that the ATG codon at position 
nt 360-362 encodes the N- terminus of MCG7. 

25 

EXAMPLE 7 

Figure 6 shows data from experiments indicating that a truncated version of mcg7 when 
expressed as a GST fursion protein can function as a Ras guanine nucleotide exchange factor. 
30 In brief, Ras (unprocessed) is loaded with 3 H-GDP then incubated in the presence of excess 
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cold GTP ± GSTmcg7. Full details of our assay can be found in Porfiri et al. J. Biol. Chem. 
269 , 22672-22677 (1994). 

Those skilled in the art will appreciate that the invention described herein is susceptible to 
5 variations and modifications other than those specifically described. It is to be understood that 
the invention includes all such variations and modifications. The invention also includes all of 
the steps, features, compositions and compounds referred to or indicated in this specification, 
individually or collectively, and any and all combinations of any two or more of said steps or 
features. 



10 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: The Council of The Queensland Institute for Medical Research 

(ii) TITLE OF INVENTION: A NOVEL GENE AND USES THEREFOR 

(iii) NUMBER OF SEQUENCES: 4 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: DAVIES COLLISON CAVE 

(B) STREET: 1 LITTLE COLLINS STREET 

(C) CITY: MELBOURNE 

(D) STATE: VICTORIA 

(E) COUNTRY: AUSTRALIA 

(F) ZIP: 3000 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: AUSTRALIAN PROVISIONAL 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 
(A) NAME: HUGHES, DR E JOHN L 
(C) REFERENCE/DOCKET NUMBER: EJH/AF 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: +61 3 9254 2777 

(B) TELEFAX: +61 3 9254 2770 

(C) TELEX: AA31787 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2415 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 
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(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 3.. 2188 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CG ATT TCA TTC CTC GCT CCC CAC AGG TCC CTC TCC CCA AAA TAT TCC 4 7 

lie Ser Phe Leu Ala Pro His Arg Ser Leu Ser Pro Lys Tyr Ser 
15 10 15 

CAT CTT GTC CTA GCC CAT CCC CCA GAC TAT CTC AAG GAC CAG CTG TCC 95 
His Leu Val Leu Ala His Pro Pro Asp Tyr Leu Lys Asp Gin Leu Ser 
20 25 30 

CCA CGC CCC CGA CCT CCA CTA GGC CTG TGC CAC CCG CTG CCT GCA GGA 14 3 

Pro Arg Pro Arg Pro Pro Leu Gly .Leu Cys His Pro Leu Pro Ala Gly 
35 40 45 

AGA CGC CCG GTC CCG GGC CGG GTT AGC CCC ATG GGA ACG CAG CGC CTG 191 
Arg Arg Pro Val Pro Gly Arg Val Ser Pro Met Gly Thr Gin Arg Leu 
50 55 60 

TGT GGC CGC GGG ACT CAA GGC TGG CCT GGC TCA AGT GAA CAG CAC GTC 23 9 

Cys Gly Arg Gly Thr Gin Gly Trp Pro Gly Ser Ser Glu Gin His Val 
65 70 75 

CAG GAG GCG ACC TCG TCC GCG GGT TTG CAT TCT GGG GTG GAC GAG CTG 287 
Gin Glu Ala Thr Ser Ser Ala Gly Leu His Ser Gly Val Asp Glu Leu 
80 85 90 95 

GGG GTT CGG TCC GAG CCC GGT GGG AGG CTC CCG GAG CGC AGC CTG GGC 335 
Gly Val Arg Ser Glu Pro Gly Gly Arg Leu Pro Glu Arg Ser Leu Gly 
100 105 110 

CCA GCC CAC CCC GCG CCG GCG GCC ATG GCA GGC ACC CTG GAC CTG GAC 3 83 

Pro Ala His Pro Ala Pro Ala Ala Met Ala Gly Thr Leu Asp Leu Asp 
115 120 125 

AAG GGC TGC ACG GTG GAG GAG CTG CTC CGC GGG TGC ATC GAA GCC TTC 431 
Lys Gly Cys Thr Val Glu Glu Leu Leu Arg Gly Cys lie Glu Ala Phe 
130 135 140 

GAT GAC TCC GGG AAG GTG CGG GAC CCG CAG CTG GTG CGC ATG TTC CTC 479 
Asp Asp Ser Gly Lys Val Arg Asp Pro Gin Leu Val Arg Met Phe Leu 
145 150 155 

ATG ATG CAC CCC TGG TAC ATC CCC TCC TCT CAG CTG GCG GCC AAG CTG 527 
Met Met His Pro Trp Tyr lie Pro Ser Ser Gin Leu Ala Ala Lys Leu 
160 165 170 175 

CTC CAC ATC TAC CAA CAA TCC CGG AAG GAC AAC TCC AAT TCC CTG CAG 575 
Leu His lie Tyr Gin Gin Ser Arg Lys Asp Asn Ser Asn Ser Leu Gin 
180 185 190 

GTG AAA ACG TGC CAC CTG GTC AGG TAC TGG ATC TCC GCC TTC CCA GCG 623 
Val Lys Thr Cys His Leu Val Arg Tyr Trp lie Ser Ala Phe Pro Ala 
195 200 205 

GAG TTT GAC TTG AAC CCG GAG TTG GCT GAG CAG ATC AAG GAG CTG AAG 671 
Glu Phe Asp Leu Asn Pro Glu Leu Ala Glu Gin lie Lys Glu Leu Lys 
210 215 220 
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GCT CTG CTA GAC CAA GAA GGG AAC CGA CGG CAC AGC AGC CTA ATC GAC 719 
Ala Leu Leu Asp Gin Glu Gly Asn Arg Arg His Ser Ser Leu lie Asp 
225 230 235 

ATA GAC AGC GTC CCT ACC TAC AAG TGG AAG CGG CAG GTG ACT CAG CGG 7 67 

lie Asp Ser Val Pro Thr Tyr Lys Trp Lys Arg Gin Val Thr Gin Arg 
240 245 250 255 

AAC CCT GTG GGA CAG AAA AAG CGC AAG ATG TCC CTG TTG TTT GAC CAC 815 
Asn Pro Val Gly Gin Lys Lys Arg Lys Met Ser Leu Leu Phe Asp His 
260 265 270 

CTG GAG CCC ATG GAG CTG GCG GAG CAT CTC ACC TAC TTG GAG TAT CGC 8 63 

Leu Glu Pro Met Glu Leu Ala Glu His Leu Thr Tyr Leu Glu Tyr Arg 
275 280 285 

TCC TTC TGC AAG ATC CTG TTT CAG GAC TAT CAC AGT TTC GTG ACT CAT 911 
Ser Phe Cys Lys lie Leu Phe Gin Asp Tyr His Ser Phe Val Thr His 
290 295 300 

GGC TGC ACT GTG GAC AAC CCC GTC CTG GAG CGG TTC ATC TCC CTC TTC 959 
Gly Cys Thr Val Asp Asn Pro Val Leu Glu Arg Phe lie Ser Leu Phe 
305 310 315 

AAC AGC GTC TCA CAG TGG GTG CAG CTC ATG ATC CTC AGC AAA CCC ACA 10 07 

Asn Ser Val Ser Gin Trp Val Gin Leu Met lie Leu Ser Lys Pro Thr 
320 325 330 335 

GCC CCG CAG CGG GCC CTG GTC ATC ACA CAC. TTT GTC CAC GTG GCG GAG 1055 
Ala Pro Gin Arg Ala Leu Val lie Thr His Phe Val His Val Ala Glu 
340 345 350 

AAG CTG CTA CAG CTG CAG AAC TTC AAC ACG CTG ATG GCA GTG GTC GGG 1103 
Lys Leu Leu Gin Leu Gin Asn Phe Asn Thr Leu Met Ala Val Val Gly 
355 360 365 

GGC CTG AGC CAC AGC TCC ATC TCC CGC CTC AAG GAG ACC CAC AGC CAC 1151 
Gly Leu Ser His Ser Ser lie Ser Arg Leu Lys Glu Thr His Ser His 
370 375 380 

GTT AGC CCT GAG ACC ATC AAG CTC TGG GAG GGT CTC ACG GAA CTA GTG 119 9 

Val Ser Pro Glu Thr lie Lys Leu Trp Glu Gly Leu Thr Glu Leu Val 
385 390 395 

ACG GCG ACA GGC AAC TAT GGC AAC TAC CGG CGT CGG CTG GCA GCC TGT 12 4 7 

Thr Ala Thr Gly Asn Tyr Gly Asn Tyr Arg Arg Arg Leu Ala Ala Cys 
400 405 410 415 

GTG GGC TTC CGC TTC CCG ATC CTG GGT GTG CAC CTC AAG GAC CTG GTG 12 95 

Val Gly Phe Arg Phe Pro lie Leu Gly Val His Leu Lys Asp Leu Val 
420 425 430 

GCC CTG CAG CTG GCA CTG CCT GAC TGG CTG GAC CCA GCC CGG ACC CGG 13 43 

Ala Leu Gin Leu Ala Leu Pro Asp Trp Leu Asp Pro Ala Arg Thr Arg 
435 440 445 

CTC AAC GGG GCC AAG ATG AAG CAG CTC TTT AGC ATC CTG GAG GAG CTG 13 91 

Leu Asn Gly Ala Lys Met Lys Gin Leu Phe Ser lie Leu Glu Glu Leu 
450 455 460 

GCC ATG GTG ACC AGC CTG CGG CCA CCA GTA CAG GCC AAC CCC GAC CTG 14 3 9 

Ala Met Val Thr Ser Leu Arg Pro Pro Val Gin Ala Asn Pro Asp Leu 
465 470 475 
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CTG AGC CTG CTC ACG GTG TCT CTG GAT CAG TAT CAG ACG GAG GAT GAG 14 87 

Leu Ser Leu Leu Thr Val Ser Leu Asp Gin Tyr Gin Thr Glu Asp Glu 
480 485 490 495 

CTG TAC CAG CTG TCC CTG CAG CGG GAG CCG CGC TCC AAG TCC TCG CCA 153 5 

Leu Tyr Gin Leu Ser Leu Gin Arg Glu Pro Arg Ser Lys Ser Ser Pro 
500 505 510 

ACC AGC CCC ACG AGT TGC ACC CCA CCA CCC CGG CCC CCG GTA CTG GAG 1583 
Thr Ser Pro Thr Ser Cys Thr Pro Pro Pro Arg Pro Pro Val Leu Glu 
515 520 525 

GAG TGG ACC TCG GCT GCC AAA CCC AAG CTG GAT CAG GCC CTC GTG GTG 1631 
Glu Trp Thr Ser Ala Ala Lys Pro Lys Leu Asp Gin Ala Leu Val Val 
530 535 540 

GAG CAC ATC GAG AAG ATG GTG GAG TCT GTG TTC CGG AAC TTT GAC GTC 167 9 

Glu His lie Glu Lys Met Val Glu Ser Val Phe Arg Asn Phe Asp Val 
545 550 555 

GAT GGG GAT GGC CAC ATC TCA CAG GAA GAA TTC CAG ATC ATC CGT GGG 1727 
Asp Gly Asp Gly His lie Ser Gin Glu Glu Phe Gin lie lie Arg Gly 
560 565 570 575 

AAC TTC CCT TAC CTC AGC GCC TTT GGG GAC CTC GAC CAG AAC CAG GAT 1775 
Asn Phe Pro Tyr Leu Ser Ala Phe Gly Asp Leu Asp Gin Asn Gin Asp 
580 585 590 

GGC TGC ATC AGC AGG GAG GAG ATG GTT TCC TAT TTC CTG CGC TCC AGC 18 2 3 

Gly Cys lie Ser Arg Glu Glu Met Val Ser Tyr Phe Leu Arg Ser Ser 
595 600 605 

TCT GTG TTG GGG GGG CGC ATG GGC TTC GTA CAC AAC TTC CAG GAG AGC 1871 
Ser Val Leu Gly Gly Arg Met Gly Phe Val His Asn Phe Gin Glu Ser 
610 615 620 

AAC TCC TTG CGC CCC GTC GCC TGC CGC CAC TGC AAA GCC CTG ATC CTG 1919 
Asn Ser Leu Arg Pro Val Ala Cys Arg His Cys Lys Ala Leu lie Leu 
625 630 635 

GGC ATC TAC AAG CAG GGC CTC AAA TGC CGA GCC TGT GGA GTG AAC TGC 19 67 

Gly lie Tyr Lys Gin Gly Leu Lys Cys Arg Ala Cys Gly Val Asn Cys 
640 645 650 655 

CAC AAG CAG TGC AAG GAT CGC CTG TCA GTT GAG TGT CGG CGC AGG GCC 2 015 

His Lys Gin Cys Lys Asp Arg Leu Ser Val Glu Cys Arg Arg Arg Ala 
660 665 670 

CAG AGT GTG AGC CTG GAG GGG TCT GCA CCC TCA CCC TCA CCC ATG CAC 2 063 

Gin Ser Val Ser Leu Glu Gly Ser Ala Pro Ser Pro Ser Pro Met His 
675 680 685 

AGC CAC CAT CAC CGC GCC TTC AGC TTC TCT CTG CCC CGC CCT GGC AGG 2111 
Ser His His His Arg Ala Phe Ser Phe Ser Leu Pro Arg Pro Gly Arg 
690 695 700 

CGA GGC TCC AGG CCT CCA GAG ATC CGT GAG GAG GAG GTA CAG ACG GTG 215 9 

Arg Gly Ser Arg Pro Pro Glu lie Arg Glu Glu Glu Val Gin Thr Val 
705 710 715 

GAG GAT GGG GTG TTT GAC ATC CAC TTG TA AT AG ATG CTG TGGTTGGATC 22 08 

Glu Asp Gly Val Phe Asp lie His Leu 
720 725 
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AAGGACTCAT TCCTGCCTTG GAGAAAATAC TTCAACCAGA GCAGGGAGCC TGGGGGTGTC 



2268 



GGGG CAGG AG GCTGGGGATG GGGGTGGGAT ATGAGGGTGG CATGCAGCTG AGGGCAGGGC 



2328 



CAGGGCTGGT GTCCCTAAGG TTGTACAGAC TCTTGTGAAT ATTTGTATTT TCCAGATGGA 



2388 



AT AAAAAGG C CCGTGTAATT AACCTTC 



2415 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 728 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

lie Ser Phe Leu Ala Pro His Arg Ser Leu Ser Pro Lys Tyr Ser His 
15 10 15 

Leu Val Leu Ala His Pro Pro Asp Tyr Leu Lys Asp Gin Leu Ser Pro 
20 25 30 

Arg Pro Arg Pro Pro Leu Gly Leu Cys His Pro Leu Pro Ala Gly Arg 
35 40 45 

Arg Pro Val Pro Gly Arg Val Ser Pro Met Gly Thr Gin Arg Leu Cys 
50 55 60 

Gly Arg Gly Thr Gin Gly Trp Pro Gly Ser Ser Glu Gin His Val Gin 
65 70 75 80 

Glu Ala Thr Ser Ser Ala Gly Leu His Ser Gly Val Asp Glu Leu Gly 
85 90 95 

Val Arg Ser Glu Pro Gly Gly Arg Leu Pro Glu Arg Ser Leu Gly Pro 
100 105 110 

Ala His Pro Ala Pro Ala Ala Met Ala Gly Thr Leu Asp Leu Asp Lys 
115 120 125 

Gly Cys Thr Val Glu Glu Leu Leu Arg Gly Cys lie Glu Ala Phe Asp 
130 135 140 

Asp Ser Gly Lys Val Arg Asp Pro Gin Leu Val Arg Met Phe Leu Met 
145 150 155 160 

Met His Pro Trp Tyr lie Pro Ser Ser Gin Leu Ala Ala Lys Leu Leu 
165 170 175 

His lie Tyr Gin Gin Ser Arg Lys Asp Asn Ser Asn Ser Leu Gin Val 
180 185 190 

Lys Thr Cys His Leu Val Arg Tyr Trp lie Ser Ala Phe Pro Ala Glu 
195 200 205 

Phe Asp Leu Asn Pro Glu Leu Ala Glu Gin He Lys Glu Leu Lys Ala 
210 215 220 

Leu Leu Asp Gin Glu Gly Asn Arg Arg His Ser Ser Leu He Asp He 
225 230 235 240 
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Asp Ser Val Pro 



Pro Val Gly Gin 
260 



Glu Pro Met Glu 
275 

Phe Cys Lys lie 
290 

Cys Thr Val Asp 
305 

Ser Val Ser Gin 



Pro Gin Arg Ala 
340 

Leu Leu Gin Leu 
355 

Leu Ser His Ser 
370 

Ser Pro Glu Thr 
385 

Ala Thr Gly Asn 



Gly Phe Arg Phe 
420 

Leu Gin Leu Ala 
435 

Asn Gly Ala Lys 
450 

Met Val Thr Ser 
465 

Ser Leu Leu Thr 



Tyr Gin Leu Ser 
500 

Ser Pro Thr Ser 
515 

Trp Thr Ser Ala 
530 

His lie Glu Lys 
545 

Gly Asp Gly His 



Phe Pro Tyr Leu 
580 



Thr Tyr Lys Trp 
245 

Lys Lys Arg Lys 



Leu Ala Glu His 
280 

Leu Phe Gin Asp 
295 

Asn Pro Val Leu 
310 

Trp Val Gin Leu 
325 

Leu Val lie Thr 



Gin Asn Phe Asn 
360 

Ser lie Ser Arg 
375 

lie Lys Leu Trp 
390 

Tyr Gly Asn Tyr 
405 

Pro lie Leu Gly 



Leu Pro Asp Trp 
440 

Met Lys Gin Leu 
455 

Leu Arg Pro Pro 
470 

Val Ser Leu Asp 
485 

Leu Gin Arg Glu 



Cys Thr Pro Pro 
520 

Ala Lys Pro Lys 
535 

Met Val Glu Ser 
550 

lie Ser Gin Glu 
565 

Ser Ala Phe Gly 



Lys Arg Gin Val 
250 

Met Ser Leu Leu 
265 

Leu Thr Tyr Leu 



Tyr His Ser Phe 
300 

Glu Arg Phe lie 
315 

Met He Leu Ser 
330 

His Phe Val His 
345 

Thr Leu Met Ala 



Leu Lys Glu Thr 
380 

Glu Gly Leu Thr 
395 

Arg Arg Arg Leu 
410 

Val His Leu Lys 
425 

Leu Asp Pro Ala 



Phe Ser He Leu 
460 

Val Gin Ala Asn 
475 

Gin Tyr Gin Thr 
490 

Pro Arg Ser Lys 
505 

Pro Arg Pro Pro 



Leu Asp Gin Ala 
540 



Val Phe Arg Asn 
555 

Glu Phe Gin He 
570 

Asp Leu Asp Gin 
585 



Thr Gin Arg Asn 
255 

Phe Asp His Leu 
270 

Glu Tyr Arg Ser 
285 

Val Thr His Gly 



Ser Leu Phe Asn 
320 

Lys Pro Thr Ala 
335 

Val Ala Glu Lys 
350 

Val Val Gly Gly 
365 

His Ser His Val 



Glu Leu Val Thr 
400 

Ala Ala Cys Val 
415 

Asp Leu Val Ala 
430 

Arg Thr Arg Leu 
445 

Glu Glu Leu Ala 



Pro Asp Leu Leu 
480 

Glu Asp Glu Leu 
495 

Ser Ser Pro Thr 
510 

Val Leu Glu Glu 
525 

Leu Val Val Glu 



Phe Asp Val Asp 
560 

He Arg Gly Asn 
575 

Asn Gin Asp Gly 
590 
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Cys lie Ser Arg Glu 
595 

Val Leu Gly Gly Arg 
610 

Ser Leu Arg Pro Val 
625 

lie Tyr Lys Gin Gly 
645 

Lys Gin Cys Lys Asp 
660 

Ser Val Ser Leu Glu 
675 

His His His Arg Ala 
690 

Gly Ser Arg Pro Pro 
705 

Asp Gly Val Phe Asp 
725 



Glu Met Val Ser Tyr 
600 

Met Gly Phe Val His 
615 

Ala Cys Arg His Cys 
630 

Leu Lys Cys Arg Ala 
650 

Arg Leu Ser Val Glu 
665 

Gly Ser Ala Pro Ser 
680 

Phe Ser Phe Ser Leu 
695 

Glu lie Arg Glu Glu 
710 

lie His Leu 



Phe Leu Arg Ser Ser Ser 
605 

Asn Phe Gin Glu Ser Asn 
620 

Lys Ala Leu lie Leu Gly 
635 640 

Cys Gly Val Asn Cys His 
655 

Cys Arg Arg Arg Ala Gin 
670 

Pro Ser Pro Met His Ser 
685 

Pro Arg Pro Gly Arg Arg 
700 

Glu Val Gin Thr Val Glu 
715 720 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 170.. 300 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

CG AT T T CAT T CCTCGCTCCC CACAGGTCCC TCTCCCCAAA ATATTCCCAT CTTGTCCTAG 6 0 

CCCATCCCCC AGACTATCTC AAGGACCAGC TGTCCCCACG CCCCCGACCT CCACTAGGCC 12 0 

TGTGCCACCC GCTGCCTGCA GGAAGACGCC CGGTCCCGGG CCGGGTTAG CCC CAT 17 5 

Pro His 
1 

GGG AAC GGG GTT CGG TCC GAG CCC GGT GGG AGG CTC CCG GAG CGC AGC 2 23 

Gly Asn Gly Val Arg Ser Glu Pro Gly Gly Arg Leu Pro Glu Arg Ser 
5 10 15 

CTG GGC CCA GCC CAC CCC GCG CCG GCG GCC ATG GCA GGC ACC CTG GAC 271 
Leu Gly Pro Ala His Pro Ala Pro Ala Ala Met Ala Gly Thr Leu Asp 
20 25 30 

CTG GAC AAG GGC TGC ACG GTG GAG GAG CT 3 00 

Leu Asp Lys Gly Cys Thr Val Glu Glu Leu 
35 40 
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(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Pro His Gly Asn Gly Val Arg Ser Glu Pro Gly Gly Arg Leu Pro Glu 
15 10 15 

Arg Ser Leu Gly Pro Ala His Pro Ala Pro Ala Ala Met Ala Gly Thr 
20 . 25 30 

Leu Asp Leu Asp Lys Gly Cys Thr Val Glu Glu Leu 
35 40 

DATED this 23 rd day of May 1997 

The Council of The Queensland Institute for Medical Research 
By DAVIES COLLISON CAVE 
Patent Attorneys for the Applicants 



FIGURE 2 



CG ATT TCA TTC CTC GCT CCC CAC AGG TCC CTC TCC CCA AAA TAT TCC 4 7 

lie Ser Phe Leu Ala Pro His Arg Ser Leu Ser Pro Lys Tyr Ser 
15 10 15 

CAT CTT GTC CTA GCC CAT CCC CCA GAC TAT CTC AAG GAC CAG CTG TCC 95 

His Leu Val Leu Ala His Pro Pro Asp Tyr Leu Lys Asp Gin Leu Ser 

20 25 30 

CCA CGC CCC CGA CCT CCA CTA GGC CTG TGC CAC CCG CTG CCT GCA GGA 143 

Pro Arg Pro Arg Pro Pro Leu Gly Leu Cys His Pro Leu Pro Ala Gly 
35 40 45 

AGA CGC CCG GTC CCG GGC CGG GTT AGC CCC ATG GGA ACG CAG CGC CTG 191 

Arg Arg Pro Val Pro Gly Arg Val Ser Pro Met Gly Thr Gin Arg Leu 
50 55 60 

TGT GGC CGC GGG ACT CAA GGC TGG CCT GGC TCA AGT GAA CAG CAC GTC 23 9 

Cys Gly Arg Gly Thr Gin Gly Trp Pro Gly Ser Ser Glu Gin His Val 
65 70 73 

CAG GAG GCG ACC TCG TCC GCG GGT TTG CAT TCT GGG GTG GAC GAG CTG 2 87 

Gin Glu Ala Thr Ser Ser Ala Gly Leu His Ser Gly Val Asp Glu Leu 
80 85 90 95 

GGG GTT CGG TCC GAG CCC GGT GGG AGG CTC CCG GAG CGC AGC CTG GGC 33 5 

Gly Val Arg Ser Glu Pro Gly Gly Arg Leu Pro Glu Arg Ser Leu Gly 

100 105 110 

CCA GCC CAC CCC GCG CCG GCG GCC ATG GCA GGC ACC CTG GAC CTG GAC 3 83 

Pro Ala His Pro Ala Pro Ala Ala Met Ala Gly Thr Leu Asp Leu Asp 
115 120 125 

AAG GGC TGC ACG GTG GAG GAG CTG CTC CGC GGG TGC ATC GAA GCC TTC 431 

Lys Gly Cys Thr Val Glu Glu Leu Leu Arg Gly Cys lie Glu Ala Phe 
130 135 140 

GAT GAC TCC GGG AAG GTG CGG GAC CCG CAG CTG GTG CGC ATG TTC CTC 47 9 

Asp Asp Ser Gly Lys Val Arg Asp Pro Gin Leu Val Arg Met Phe Leu 
145 150 155 

ATG ATG CAC CCC TGG TAC ATC CCC TCC TCT CAG CTG GCG GCC AAG CTG 52 7 

Met Met His Pro Trp Tyr lie Pro Ser Ser Gin Leu Ala Ala Lys Leu 
160 165 170 175 

CTC CAC ATC TAC CAA CAA TCC CGG AAG GAC AAC TCC AAT TCC CTG CAG 57 5 

Leu His lie Tyr Gin Gin Ser Arg Lys Asp Asn Ser Asn Ser Leu Gin 

180 185 190 

GTG AAA ACG TGC CAC CTG GTC AGG TAC TGG ATC TCC GCC TTC CCA GCG 62 3 

Val Lys Thr Cys His Leu Val Arg Tyr Trp lie Ser Ala Phe Pro Ala 
195 200 205 

GAG TTT GAC TTG AAC CCG GAG TTG GCT GAG CAG ATC AAG GAG CTG AAG 671 
Figure 2 (continued) 

Glu Phe Asp Leu Asn Pro Glu Leu Ala Glu Gin lie Lys Glu Leu Lys 
210 215 220 

GCT CTG CTA GAC CAA GAA GGG AAC CGA CGG CAC AGC AGC CTA ATC GAC 719 

Ala Leu Leu Asp Gin Glu Gly Asn Arg Arg His Ser Ser Leu lie Asp 
225 230 235 



ATA GAC AGC GTC CCT ACC TAC AAG TGG AAG CGG CAG GTG ACT CAG CGG 767 
He Asp Ser Val Pro Thr Tyr Lys Trp Lys Arg Gin Val Thr Gin Arg 
240 245 250 255 

AAC CCT GTG GGA CAG AAA AAG CGC AAG ATG TCC CTG TTG TTT GAC CAC 815 
Asn Pro Val Gly Gin Lys Lys Arg Lys Met Ser Leu Leu Phe Asp His 
260 265 270 

CTG GAG CCC ATG GAG CTG GCG GAG CAT CTC ACC TAC TTG GAG TAT CGC 8 63 

Leu Glu Pro Met Glu Leu Ala Glu His Leu Thr Tyr Leu Glu Tyr Arg 
275 280 285 

TCC TTC TGC AAG ATC CTG TTT CAG GAC TAT CAC AGT TTC GTG ACT CAT 911 
Ser Phe Cys Lys He Leu Phe Gin Asp Tyr His Ser Phe Val Thr His 
290 295 300 

GGC TGC ACT GTG GAC AAC CCC GTC CTG GAG CGG TTC ATC TCC CTC TTC 959 
Gly Cys Thr Val Asp Asn Pro Val Leu Glu Arg Phe He Ser Leu Phe 
305 310 315 

AAC AGC GTC TCA CAG TGG GTG CAG CTC ATG ATC CTC AGC AAA CCC ACA 10 07 

Asn Ser Val Ser Gin Trp Val Gin Leu Met He Leu Ser Lys Pro Thr 
320 325 330 335 

GCC CCG CAG CGG GCC CTG GTC ATC ACA CAC TTT GTC CAC GTG GCG GAG 1055 
Ala Pro Gin Arg Ala Leu Val He Thr His Phe Val His Val Ala Glu 
340 345 350 

AAG CTG CTA CAG CTG CAG AAC TTC AAC ACG CTG ATG GCA GTG GTC GGG 1103 
Lys Leu Leu Gin Leu Gin Asn Phe Asn Thr Leu Met Ala Val Val Gly 
355 360 365 

GGC CTG AGC CAC AGC TCC ATC TCC CGC CTC AAG GAG ACC CAC AGC CAC 1151 
Gly Leu Ser His Ser Ser He Ser Arg Leu Lys Glu Thr His Ser His 
370 375 380 

GTT AGC CCT GAG ACC ATC AAG CTC TGG GAG GGT CTC ACG GAA CTA GTG 1199 
Val Ser Pro Glu Thr He Lys Leu Trp Glu Gly Leu Thr Glu Leu Val 
385 390 395 

ACG GCG ACA GGC AAC TAT GGC AAC TAC CGG CGT CGG CTG GCA GCC TGT 12 4 7 

Thr Ala Thr Gly Asn Tyr Gly Asn Tyr Arg Arg Arg Leu Ala Ala Cys 
400 • 405 410 415 

GTG GGC TTC CGC TTC CCG ATC CTG GGT GTG CAC CTC AAG GAC CTG GTG 12 95 

Val Gly Phe Arg Phe Pro He Leu Gly Val His Leu Lys Asp Leu Val 
420 425 430 

GCC CTG CAG CTG GCA CTG CCT GAC TGG CTG GAC CCA GCC CGG ACC CGG 13 4 3 

Ala Leu Gin Leu Ala Leu Pro Asp Trp Leu Asp Pro Ala Arg Thr Arg 

Figure 2 (continued) 

435 440 445 

CTC AAC GGG GCC AAG ATG AAG CAG CTC TTT AGC ATC CTG GAG GAG CTG 13 91 

Leu Asn Gly Ala Lys Met Lys Gin Leu Phe Ser He Leu Glu Glu Leu 
450 455 460 

GCC ATG GTG ACC AGC CTG CGG CCA CCA GTA CAG GCC AAC CCC GAC CTG 14 3 9 

Ala Met Val Thr Ser Leu Arg Pro Pro Val Gin Ala Asn Pro Asp Leu 
465 470 475 

CTG AGC CTG CTC ACG GTG TCT CTG GAT CAG TAT CAG ACG GAG GAT GAG 14 87 

Leu Ser Leu Leu Thr Val Ser Leu Asp Gin Tyr Gin Thr Glu Asp Glu 
480 485 490 495 

CTG TAC CAG CTG TCC CTG CAG CGG GAG CCG CGC TCC AAG TCC TCG CCA 153 5 

Leu Tyr Gin Leu Ser Leu Gin Arg Glu Pro Arg Ser Lys Ser Ser Pro 
500 505 510 



ACC AGC CCC ACG AGT TGC ACC CCA CCA CCC CGG CCC CCG GTA CTG GAG 158 3 

Thr Ser Pro Thr Ser Cys Thr Pro Pro Pro Arg Pro Pro Val Leu Glu 
515 520 525 

GAG TGG ACC TCG GCT GCC AAA CCC AAG CTG GAT CAG GCC CTC GTG GTG 1631 

Glu Trp Thr Ser Ala Ala Lys Pro Lys Leu Asp Gin Ala Leu Val Val 
530 535 540 

GAG CAC ATC GAG AAG ATG GTG GAG TCT GTG TTC CGG AAC TTT GAC GTC 167 9 

Glu His lie Glu Lys Met Val Glu Ser Val Phe Arg Asn Phe Asp Val 

545 550 555 

GAT GGG GAT GGC CAC ATC TCA CAG GAA GAA TTC CAG ATC ATC CGT GGG 172 7 

Asp Gly Asp Gly His lie Ser Gin Glu Glu Phe Gin lie lie Arg Gly 
560 565 570 575 

AAC TTC CCT TAC CTC AGC GCC TTT GGG GAC CTC GAC CAG AAC CAG GAT 177 5 

Asn Phe Pro Tyr Leu Ser Ala Phe Gly Asp Leu Asp Gin Asn Gin Asp 
580 585 590 

GGC TGC ATC AGC AGG GAG GAG ATG GTT TCC TAT TTC CTG CGC TCC AGC 18 23 

Gly Cys lie Ser Arg Glu Glu Met Val Ser Tyr Phe Leu Arg Ser Ser 
595 600 605 

TCT GTG TTG GGG GGG CGC ATG GGC TTC GTA CAC AAC TTC CAG GAG AGC 18 71 

Ser Val Leu Gly Gly Arg Met Gly Phe Val His Asn Phe Gin Glu Ser 
610 615 620 

AAC TCC TTG CGC CCC GTC GCC TGC CGC CAC TGC AAA GCC CTG ATC CTG 1919 

Asn Ser Leu Arg Pro Val Ala Cys Arg His Cys Lys Ala Leu lie Leu 

625 630 635 

GGC ATC TAC AAG CAG GGC CTC AAA TGC CGA GCC TGT GGA GTG AAC TGC 1967 

Gly lie Tyr Lys Gin Gly Leu Lys Cys Arg Ala Cys Gly Val Asn Cys 
640 645 650 655 

CAC AAG CAG TGC AAG GAT CGC CTG TCA GTT GAG TGT CGG CGC AGG GCC 2 015 

His Lys Gin Cys Lys Asp Arg Leu Ser Val Glu Cys Arg Arg Arg Ala 



Figure 2 (continued) 



660 665 670 

CAG AGT GTG AGC CTG GAG GGG TCT GCA CCC TCA CCC TCA CCC ATG CAC 2 0 63 

Gin Ser Val Ser Leu Glu Gly Ser Ala Pro Ser Pro Ser Pro Met His 
675 680 685 

AGC CAC CAT CAC CGC GCC TTC AGC TTC TCT CTG CCC CGC CCT GGC AGG 2111 
Ser His His His Arg Ala Phe Ser Phe Ser Leu Pro Arg Pro Gly Arg 
690 695 700 

CGA GGC TCC AGG CCT CCA GAG ATC CGT GAG GAG GAG GTA CAG ACG GTG 215 9 

Arg Gly Ser Arg Pro Pro Glu lie Arg Glu Glu Glu Val Gin Thr Val 
705 710 715 

GAG GAT GGG GTG TTT GAC ATC CAC TTG TA ATAGATGCTG TGGTTGGATC 2 2 08 
Glu Asp Gly Val Phe Asp He His Leu 
720 725 

AAGGACT CAT TCCTGCCTTG GAGAAAATAC TTCAACCAGA GCAGGGAGCC TGGGGGTGTC 22 68 

GGGG CAGG AG GCTGGGGATG GGGGTGGGAT ATGAGGGTGG CATGCAGCTG AGGGCAGGGC 232 8 

CAGGGCTGGT GTCCCTAAGG TTGTACAGAC TCTTGTGAAT ATTTGTATTT TCCAGATGGA 23 8 8 



AT AAAAAGG C CCGTGTAATT AACCTTCA 



2416 
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MCG7 - Cloning of a novel human gene that encodes a guanine exchange factor 



CGATTTCATTCCTCGCTCCCCACAGGTCCCTCTCCCCAAAATATTCCCATCTTGTCCTAG 6 0 
ISFLAPHRSLSPKYSHLVL 19 
CCCATCCCCCAGACTATCTCAAGGACCAGCTGTCCCCACGCCCCCGACCTCCACTAGGCC 12 0 
AHPPDYLKDQLSPRPRPPLG 39 
TGTGCCACCCGCTGCCTGCAGGAAGACGCCCGGTCCCGGGCCGGGTTAGCCCCATGGGAA 18 0 
LCHPLPAGRRPVPGRVSPMG 59 
CGcagcgcctgtgtggccgcgggactcaaggctggcctggctcaagtgaacagcacgtcc 24 0 
TQRLCGRGTQGWPGS SEQHV 79 
aggaggcgacctcgtccgcgggtttgcattctggggtggacgagctggGGGTTCGGTCCG 3 00 
QEATS SAGLHSGVDELGVRS 99 
AGCCCGGTGGGAGGCTCCCGGAGCGCAGCCTGGGCCCAGCCCACCCCGCGCCGGCGGCCA 36 0 
EPGGRLPERSLGPAHPAPAA 119 
TGGCAGGCACCCTGGACCTGGACAAGGGCTGCACGGTGGAGGAGCTGCTCCGCGGGTGCA 4 2 0 
MAGTLDLDKGCT VEE LLRGC 139 
TCGAAGCCTTCGATGACTCCGGGAAGGTGCGGGACCCGCAGCTGGTGCGCATGTTCCTCA 4 8 0 
I EAFDDSGKVRDPQLVRMFL 159 
TGATGCACCCCTGGTACATCCCCTCCTCTCAGCTGGCGGCCAAGCTGCTCCACATCTACC 54 0 
MMHPWY I PS SQLAAKLLH IY 179 
AACAATCCCGGAAGGACAACTCCAATTCCCTGCAGGTGAAAACGTGCCACCTGGTCAGGT 6 0 0 
QQS RKDNSNS LQVKTCHLVR 199 
ACTGGATCTCCGCCTTCCCAGCGGAGTTTGACTTGAACCCGGAGTTGGCTGAGCAGATCA 6 6 0 
YW ISAFPAE FDLNPELAEQI 219 
AGGAGCTGAAGGCTCTGCTAGACCAAGAAGGGAACCGACGGCACAGCAGCCTAATCGACA 7 2 0 
KELKALLDQEGNRRHSSLID 239 
TAGACAGCGTCCCTACCTACAAGTGGAAGCGGCAGGTGACTCAGCGGAACCCTGTGGGAC 7 8 0 
IDSVPTYKWKRQVTQRNPVG 259 
AGAAAAAGCGCAAGATGTCCCTGTTGTTTGACCACCTGGAGCCCATGGAGCTGGCGGAGC 8 4 0 
QKKRKMSLLFDHLEPMELAE 279 
ATCTCACCTACTTGGAGTATCGCTCCTTCTGCAAGATCCTGTTTCAGGACTATCACAGTT 9 0 0 
HLTYLEYRS FCKILFQDYHS 299 
TCGTGACTCATGGCTGCACTGTGG ACAACCCCGTCCTGGAGCGGTTCATCTCCCTCTTCA 96 0 
FVTHGCTVDNPVLERFI SLF 319 
ACAGCGTCTCACAGTGGGTGCAGCTCATGATCCTCAGCAAACCCACAGCCCCGCAGCGGG 102 0 
NSVSQWVQLM ILSKPTAPQR 339 
CCCTGGTCATCACACACTTTGTCCACGTGGCGGAGAAGCTGCTACAGCTGCAGAACTTCA 10 80 
ALVI THFVHVAEKLLQLQNF 359 
ACACGCTGATGGCAGTGGTCGGGGGCCTGAGCCACAGC £ C CAT C T C C CGC CT C AAGG AG A 114 0 
NTLMAVVGGLSHSS I SRLKE 379 
CCCACAGCCACGTTAGCCCTGAG ACC ATCAAGCTCTGGGAGGGTCTCACGGAACTAGTGA 12 00 
THSHVS PET I KLWEGLTELV 399 
CGGCGACAGGCAACTATGGCAACTACCGGCGTCGGCTGGCAGCCTGTGTGGGCTTCCGCT 12 6 0 
TATGNYGNYRRRLAACVGFR 419 
TCCCGATCCTGGGTGTGCACCTCAAGGACCTGGTGGCCCTGCAGCTGGCACTGCCTGACT 13 2 0 
FP ILGVHLKDLVALQLALPD 439 
GGCTGGACCCAGCCCGGACCCGGCTCAACGGGGCCAAGATGAAGCAGCTCTTTAGCATCC 13 8 0 
W L D P A RTRLNGAKMKQLFS I 459 
TGGAGGAGCTGGCCATGGTGACCAGCCTGCGGCCACCAGTACAGGCCAACCCCGACCTGC 14 4 0 
LEELAMVTS LRPPVQANPDL 479 
TGAGCCTGCTCACGGTGTCTCTGG ATCAGTATCAGACGGAGGATGAGCTGTACCAGCTGT 1500 
LS LLTVS LD QYQTEDELYQL 499 
CCCTGCAGCGGGAGCCGCGCTCCAAGTCCTCGCCAACCAGCCCCACGAGTTGCACCCCAC 156 0 
SLQREPRSKSSPTSPTSCTP 519 
CACCCCGGCCCCCGGTACTGGAGGAGTGGACCTCGGCTGCCAAACCCAAGCTGGATCAGG 1620 
PPRPPVLEEWTSAAKPKLDQ 539 
CCCTCGTGGTGGAGCACATCGAGAAGATGGTGGAGTCTGTGTTCCGGAACTTTGACGTCG 16 80 
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ALVVEHIEKMVESVFRNFDV 559 
ATGGGGATGGCCACATCTCACAGGAAGAATTCCAGATCATCCGTGGGAACTTCCCTTACC 174 0 
DGDGHISQEEFQI IRGNFPY 579 

TCAGCGCCTTTGGGGACCTCGACCAGAACCAGGATGGCTGCATCAGCAGGGAGGAGATGG 1800 

LSAFGDLDQNQDGCISREEM 599 

TTTCCTATTTCCTGCGCTCCAGCTCTGTGTTGGGGGGGCGCATGGGCTTCGTACACAACT 186 0 

VSYFLRSSSVLGGRMGFVHN 619 

TCCAGGAGAGCAACTCCTTGCGCCCCGTCGCCTGCCGCCACTGCAAAGCCCTGATCCTGG 1920 

FQES NSLRPVACRHCKALIL 639 

GCATCTACAAGCAGGGCCTCAAATGCCGAGCCTGTGGAGTGAACTGCCACAAGCAGTGCA 1980 

GIYKQGLKCRACGVNCHKQC 659 

• AGGATCGCCTGTCAGTTGAGTGTCGGCGCAGGGCCCAGAGTGTGAGCCTGGAGGGGTCTG 2 04 0 

KDRLSVECRRRAQSVSLEGS 679 

CACCCTCACCCTCACCCATGCACAGCCACCATCACCGCGCCTTCAGCTTCTCTCTGCCCC 2100 

APSPSPMHSH HHRAFS FSLP 699 

GCCCTGGCAGGCGAGGCTCCAGGCCTCCAGAGATCCGTGAGGAGGAGGTACAGACGGTGG 2160 

RPGRRGSRPPEIREEEVQTV 719 

AGGATGGGGTGTTTGACATCCACTTGTAATAGATGCTGTGGTTGGATCAAGGACTCATTC 222 0 
EDGVFDIHL* 72 q- 

CTGCCTTGGAGAAAATACTTCAACCAGAGCAGGGAGCCTGGGGGTGTCGGGGCAGGAGGC 22 8 0 
TGGGGATGGGGGTGGGATATGAGGGTGGCATGCAGCTGAGGGCAGGGCCAGGGCTGGTGT 2 34 0 
CCCTAAGGTTGTACAGACTCTTGTGAATATTTGTATTTTCCAGATGGAATAAAAAGGCCC 24 00 
GTGTAATTAACCTTC (A) , 
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Figure 2b 



CGATTTCATTCCTCGCTCCCCACAGGTCCCTCTCCCCAAAATATTCCCATCTTGTCCTAG 
CCCATCCCCCAGACTATCTCAAGGACCAGCTGTCCCCACGCCCCCGACCTCCACTAGGCC 
TGTGCCACCCGCTGCCTGCAGGAAGACGCCCGGTCCCGGGCCGGGTTAGCCCCATGGGAA 

+ p h g n 

CGGGGTTCGGTCCGAGCCCGGTGGGAGGCTCCCGGAGCGCAGCCTGGGCCCAGCCCACCC 
gvrsepggrlperslgpahp 

CGCGCCGGCGGCCATGGCAGGCACCCTGGACCTGGACAAGGGCTGCACGGTGGAGGAGCT 
a paaMAGTLDLDKGCTVE EL 
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human MAGTLDLD KGCTVEELLRGC I EAFDDS — OKV^PQLVKMFLblMHFWyiPSSQLiAAK 

nematode MSSKVKEDQHQEL^TEDQLVAR(^^ 
* * * . . * . 

human I^IYQQSTUO^SWSLQVKTCHLVRYWI SAFPAEFDLNPELAEQrKELKAT J .PQEGNRRH 

nematode FVNFYQETRNVEQ REAVCRAVSFWIEKFFMHFI^QPQVCAQWRLKT IA- EDINENI 

** * * * ** ** ** * * ** * 



human 
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******** * * 
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Figure 4 



human CGATTTCATT CCTCGCTCCC CACAGGTCCC TCTCCCCAAA ATATTCCCAT 

CTTGTCCTAG 6 0 ^„»~~n, 

human CCCATCCCCC AGACTATCTC AAGGACCAGC TGTCCCCACG CCCCCGACCT 

CCACTAGGCC 120 ^^^.^ 

human TGTGCCACCC GCTGCCTGCA GGAAGACGCC CGGTCCCGGG CCGGGTTAGC 

CCCATGGGAA 180 

human CGCAGCGCCT GTGTGGCCGC GGG ACT C AAG GCTGGCCTGG CTCAAGTGAA 

CAGCACGTCC 240 
mouse 



***tcag** ****ag**** t********* 



human 9 t> AGGAGG CGAC CTCGTCCGCG GGTTTGCATT CTGGGGTGGA CGAGCTGGGG 



GTTCGGTCCG 3 00 



acagg 
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mouse g***** t ** a **-*catt** ********** ***aa**aa* g**ct***** 

* *a**aatz* * > 

human AGCCCGGTGG GAGGCTCCCG GAGCGCAGCC TGGGCCCAGC CCACCCCGCG 

CCGGCGGCCA 360 

mouse *** a *t*"* **"***tga ***t*t*a*t ****t*t*** ***-*tg**a 

human 3 > TGGCAGGCAC CCTGGACCTG GACAAGGGCT GCACGGTGGA GGAGCTGCTC 

CGCGGGTGCA 4 20 
mouse **** ga **** t 1 

human"" C ^ TCGAAGCCTT CGATGACTCC GGGAAGGTGC GGGACCCGCA GCTGGTGCGC 
ATGTTCCTCA 4 80 

mouse ********** t ******** c ** a ******* * a **f*a** *** a ****** 

*****^****> 

human TGATGCACCC CTGGTACATC CCCTCCTCTC AGCTGGCGGC CAAGCTGCTC 

CACATCTACC 54 0 

mouse ******* *a ** t ******* *******tt* g**a****** 

♦**t****t*> 

human AACAATCCCG GAAGGACAAC TCCAATTCCC TGCAGGTGAA AACGTGCCAC 

CTGGTCAGGT 6 00 

mouse * g ******** ********** ********t 

AC 

GAG GAG AT CA 660 
mouse ** 

* * a * 



c ********* ******** t * **** C ***** *********i 



-***-,***♦ ******£*** 

■m g wwwwwwww -------- - - i_ CX w 

£*********> 

human ACTGGATCTC CGCCTTCCCA GCGGAGTTTG ACTTGAACCC GGAGTTGGCT 

a ********* ** a ***** c * ********** a***c***** 



human AGGAGCTGAA GGCTCTGCTA G A C CAAG A AG GGAACCGACG GCACAGCAGC 

CTAATCGACA 7 2 0 

_ ********** *******r** ********** *******ca* **♦*++***' 

mou se 



* * c * ** * ♦ ** > 

human TAGACAGCGT 
730 

mouse * c ** g ** t ** 
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CACGCCTCGGAAGGGAGGTTTGGGGTCGGTGGTTTCACAGTGAGTGTGTCTGAAGCCAAA 6 0 
TGGTCGGAAACCGTTACCCGCTCTCCTAGGCCCGGCTAGTGGGGACCCCAACCGCCTGCG 12 0 

*ARLVGTPTAC> 
GCTGCCCCTCCCAAGTTCCTCCCTGTTGGCCAGGCATCCAGGTCTCCAGTCTCCGAGCTG 180 
GCPSQVPPCWPGIQVSSLRA> 
CGGAGAACCCACCGCCACATGCGGCTGCCCCTTTCCATTCGACCCTGTGGGGAGCCAGGC 2 4 0 
AENPPPHAAAPFHSTLWGA R> 
TTCCGGGGCCCCGTTCCTCCTGTGTGAACTGGGCCCCCCGCCCCCATTCCCAGACATCAA 3 0 0 
LPG PRSSCVNWAPRPHSQTS> 
GGCCGCGTCTCCAGATAGCCACGATTTCATTCCTCGCTCCCCACAGGTCCCTCTCCCCAA 36 0 
RPRLQIATISFLAPHRSLSP> 
AATATTCCCATCTTGTCCTAGCCCATCC-CCAGACTATCTCAAGGACCAGCTGTCCCCAC 42 0 
KYSHLVLAHPPDYLKDQLSP> 
GCCCCCGACCTCCACTAGGCCTGTGCCACCCGCTGCCTGCAGGAAGACGCCCGGTCCCGG 480 
RPRPPLGLCHPLPAGRRPVP> 
GCCGGGTTAGCCCCATGGGAACGcagcgcctgtgtggccgcgggac tcaaggc tggcctg 54 0 
* p h g n 

GRVS PMGTQRLCGRGTQGW P> 
gctcaagtgaacagcacgcccaggaggcgacctcgtccgcgggtttgcattctggggtgg 600 
GSSEQHVQEATSSAGLHSG V> 
acgagctggGGGTTCGGTCCGAGCCCGGTGGGAGGCTCCCGGAGCGCAGCCTGGGCCCAG 660 
DELGVRSEPGGRLPERSLGP> 
CCCACCCCGCGCCGGCGGCCATGGCAGGCACCCTGGACCTGGACAAGGGCTGCACGGTGG 72 0 
AHPAPAAMAGTLDLDKGCT V> 
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